Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 3678 |
| Missing cells | 6712 |
| Missing cells (%) | 7.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.3 MiB |
| Average record size in memory | 668.1 B |
Variable types
| Categorical | 13 |
|---|---|
| Numeric | 10 |
society has a high cardinality: 676 distinct values | High cardinality |
sector has a high cardinality: 115 distinct values | High cardinality |
areaWithType has a high cardinality: 2355 distinct values | High cardinality |
price is highly overall correlated with price_per_sqft and 7 other fields | High correlation |
price_per_sqft is highly overall correlated with price | High correlation |
area is highly overall correlated with price and 5 other fields | High correlation |
bedRoom is highly overall correlated with price and 5 other fields | High correlation |
bathroom is highly overall correlated with price and 5 other fields | High correlation |
super_built_up_area is highly overall correlated with price and 7 other fields | High correlation |
built_up_area is highly overall correlated with price and 4 other fields | High correlation |
carpet_area is highly overall correlated with price and 5 other fields | High correlation |
property_type is highly overall correlated with price and 2 other fields | High correlation |
facing is highly overall correlated with built_up_area | High correlation |
servant room is highly overall correlated with bathroom and 1 other fields | High correlation |
store room is highly imbalanced (55.7%) | Imbalance |
facing has 1045 (28.4%) missing values | Missing |
super_built_up_area has 1803 (49.0%) missing values | Missing |
built_up_area has 1988 (54.1%) missing values | Missing |
carpet_area has 1805 (49.1%) missing values | Missing |
area is highly skewed (γ1 = 29.73500562) | Skewed |
built_up_area is highly skewed (γ1 = 40.70657243) | Skewed |
carpet_area is highly skewed (γ1 = 24.33967469) | Skewed |
floorNum has 129 (3.5%) zeros | Zeros |
luxury_score has 462 (12.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-09-13 17:24:22.262227 |
|---|---|
| Analysis finished | 2024-09-13 17:24:55.330393 |
| Duration | 33.07 seconds |
| Software version | pandas-profiling vv3.6.3 |
| Download configuration | config.json |
property_type
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 248.7 KiB |
| flat | |
|---|---|
| house |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.2335508 |
| Min length | 4 |
Characters and Unicode
| Total characters | 15571 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | house |
|---|---|
| 2nd row | flat |
| 3rd row | house |
| 4th row | flat |
| 5th row | flat |
Common Values
| Value | Count | Frequency (%) |
| flat | 2819 | |
| house | 859 | 23.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| flat | 2819 | |
| house | 859 | 23.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2819 | |
| l | 2819 | |
| a | 2819 | |
| t | 2819 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15571 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 2819 | |
| l | 2819 | |
| a | 2819 | |
| t | 2819 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15571 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 2819 | |
| l | 2819 | |
| a | 2819 | |
| t | 2819 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15571 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 2819 | |
| l | 2819 | |
| a | 2819 | |
| t | 2819 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
society
Categorical
| Distinct | 676 |
|---|---|
| Distinct (%) | 18.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 294.0 KiB |
| independent | |
|---|---|
| tulip violet | 75 |
| ss the leaf | 73 |
| shapoorji pallonji joyville gurugram | 42 |
| dlf new town heights | 42 |
| Other values (671) |
Length
| Max length | 49 |
|---|---|
| Median length | 39 |
| Mean length | 16.870003 |
| Min length | 1 |
Characters and Unicode
| Total characters | 62031 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 307 ? |
|---|---|
| Unique (%) | 8.3% |
Sample
| 1st row | arjun marg/ sector- 26 phase- 1/ golf course road |
|---|---|
| 2nd row | pivotal riddhi siddhi |
| 3rd row | unitech espace |
| 4th row | godrej nature plus |
| 5th row | godrej nature plus |
Common Values
| Value | Count | Frequency (%) |
| independent | 486 | 13.2% |
| tulip violet | 75 | 2.0% |
| ss the leaf | 73 | 2.0% |
| shapoorji pallonji joyville gurugram | 42 | 1.1% |
| dlf new town heights | 42 | 1.1% |
| signature global park | 35 | 1.0% |
| shree vardhman victoria | 34 | 0.9% |
| smart world orchard | 32 | 0.9% |
| emaar mgf emerald floors premier | 32 | 0.9% |
| dlf the ultima | 31 | 0.8% |
| Other values (666) | 2795 |
Length
| Value | Count | Frequency (%) |
| independent | 491 | 5.1% |
| the | 350 | 3.6% |
| dlf | 220 | 2.3% |
| park | 209 | 2.2% |
| city | 166 | 1.7% |
| emaar | 155 | 1.6% |
| global | 153 | 1.6% |
| m3m | 152 | 1.6% |
| signature | 150 | 1.5% |
| heights | 134 | 1.4% |
| Other values (783) | 7500 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6710 | 10.8% |
| 6005 | 9.7% | |
| a | 5861 | 9.4% |
| r | 4173 | 6.7% |
| n | 4163 | 6.7% |
| i | 3831 | 6.2% |
| t | 3719 | 6.0% |
| s | 3473 | 5.6% |
| l | 2946 | 4.7% |
| o | 2758 | 4.4% |
| Other values (31) | 18392 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 55481 | |
| Space Separator | 6005 | 9.7% |
| Decimal Number | 527 | 0.8% |
| Other Punctuation | 10 | < 0.1% |
| Dash Punctuation | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6710 | |
| a | 5861 | 10.6% |
| r | 4173 | 7.5% |
| n | 4163 | 7.5% |
| i | 3831 | 6.9% |
| t | 3719 | 6.7% |
| s | 3473 | 6.3% |
| l | 2946 | 5.3% |
| o | 2758 | 5.0% |
| d | 2489 | 4.5% |
| Other values (16) | 15358 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 207 | |
| 2 | 82 | 15.6% |
| 1 | 75 | 14.2% |
| 6 | 56 | 10.6% |
| 8 | 32 | 6.1% |
| 4 | 19 | 3.6% |
| 5 | 17 | 3.2% |
| 9 | 13 | 2.5% |
| 0 | 13 | 2.5% |
| 7 | 13 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 | |
| / | 2 | 20.0% |
| . | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 6005 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 55481 | |
| Common | 6550 | 10.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6710 | |
| a | 5861 | 10.6% |
| r | 4173 | 7.5% |
| n | 4163 | 7.5% |
| i | 3831 | 6.9% |
| t | 3719 | 6.7% |
| s | 3473 | 6.3% |
| l | 2946 | 5.3% |
| o | 2758 | 5.0% |
| d | 2489 | 4.5% |
| Other values (16) | 15358 |
Common
| Value | Count | Frequency (%) |
| 6005 | ||
| 3 | 207 | 3.2% |
| 2 | 82 | 1.3% |
| 1 | 75 | 1.1% |
| 6 | 56 | 0.9% |
| 8 | 32 | 0.5% |
| 4 | 19 | 0.3% |
| 5 | 17 | 0.3% |
| 9 | 13 | 0.2% |
| 0 | 13 | 0.2% |
| Other values (5) | 31 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62031 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6710 | 10.8% |
| 6005 | 9.7% | |
| a | 5861 | 9.4% |
| r | 4173 | 6.7% |
| n | 4163 | 6.7% |
| i | 3831 | 6.2% |
| t | 3719 | 6.0% |
| s | 3473 | 5.6% |
| l | 2946 | 4.7% |
| o | 2758 | 4.4% |
| Other values (31) | 18392 |
sector
Categorical
| Distinct | 115 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 266.9 KiB |
| sohna road | 153 |
|---|---|
| sector 85 | 108 |
| sector 102 | 107 |
| sector 92 | 99 |
| sector 69 | 93 |
| Other values (110) |
Length
| Max length | 26 |
|---|---|
| Median length | 9 |
| Mean length | 9.3175639 |
| Min length | 3 |
Characters and Unicode
| Total characters | 34270 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | sector 26 |
|---|---|
| 2nd row | sector 99 |
| 3rd row | sector 50 |
| 4th row | sector 33 |
| 5th row | sector 33 |
Common Values
| Value | Count | Frequency (%) |
| sohna road | 153 | 4.2% |
| sector 85 | 108 | 2.9% |
| sector 102 | 107 | 2.9% |
| sector 92 | 99 | 2.7% |
| sector 69 | 93 | 2.5% |
| sector 90 | 89 | 2.4% |
| sector 81 | 87 | 2.4% |
| sector 65 | 87 | 2.4% |
| sector 109 | 85 | 2.3% |
| sector 79 | 76 | 2.1% |
| Other values (105) | 2694 |
Length
| Value | Count | Frequency (%) |
| sector | 3450 | |
| road | 177 | 2.4% |
| sohna | 165 | 2.2% |
| 85 | 108 | 1.5% |
| 102 | 107 | 1.4% |
| 92 | 99 | 1.3% |
| 69 | 93 | 1.3% |
| 90 | 89 | 1.2% |
| 81 | 87 | 1.2% |
| 65 | 87 | 1.2% |
| Other values (107) | 2923 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3803 | |
| 3707 | ||
| s | 3694 | |
| r | 3694 | |
| e | 3549 | |
| c | 3501 | |
| t | 3461 | |
| 1 | 1074 | 3.1% |
| 0 | 802 | 2.3% |
| 8 | 778 | 2.3% |
| Other values (21) | 6207 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23305 | |
| Decimal Number | 7258 | 21.2% |
| Space Separator | 3707 | 10.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3803 | |
| s | 3694 | |
| r | 3694 | |
| e | 3549 | |
| c | 3501 | |
| t | 3461 | |
| a | 697 | 3.0% |
| d | 248 | 1.1% |
| n | 229 | 1.0% |
| h | 202 | 0.9% |
| Other values (10) | 227 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1074 | |
| 0 | 802 | |
| 8 | 778 | |
| 9 | 762 | |
| 6 | 740 | |
| 7 | 682 | |
| 2 | 679 | |
| 3 | 666 | |
| 5 | 592 | |
| 4 | 483 |
Space Separator
| Value | Count | Frequency (%) |
| 3707 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23305 | |
| Common | 10965 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3803 | |
| s | 3694 | |
| r | 3694 | |
| e | 3549 | |
| c | 3501 | |
| t | 3461 | |
| a | 697 | 3.0% |
| d | 248 | 1.1% |
| n | 229 | 1.0% |
| h | 202 | 0.9% |
| Other values (10) | 227 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 3707 | ||
| 1 | 1074 | 9.8% |
| 0 | 802 | 7.3% |
| 8 | 778 | 7.1% |
| 9 | 762 | 6.9% |
| 6 | 740 | 6.7% |
| 7 | 682 | 6.2% |
| 2 | 679 | 6.2% |
| 3 | 666 | 6.1% |
| 5 | 592 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34270 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3803 | |
| 3707 | ||
| s | 3694 | |
| r | 3694 | |
| e | 3549 | |
| c | 3501 | |
| t | 3461 | |
| 1 | 1074 | 3.1% |
| 0 | 802 | 2.3% |
| 8 | 778 | 2.3% |
| Other values (21) | 6207 |
price
Real number (ℝ)
| Distinct | 473 |
|---|---|
| Distinct (%) | 12.9% |
| Missing | 17 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5341464 |
| Minimum | 0.07 |
|---|---|
| Maximum | 31.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 0.07 |
|---|---|
| 5-th percentile | 0.37 |
| Q1 | 0.95 |
| median | 1.52 |
| Q3 | 2.75 |
| 95-th percentile | 8.5 |
| Maximum | 31.5 |
| Range | 31.43 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 2.9803593 |
|---|---|
| Coefficient of variation (CV) | 1.1760801 |
| Kurtosis | 14.932733 |
| Mean | 2.5341464 |
| Median Absolute Deviation (MAD) | 0.72 |
| Skewness | 3.2787171 |
| Sum | 9277.51 |
| Variance | 8.8825413 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.25 | 80 | 2.2% |
| 1.5 | 64 | 1.7% |
| 1.2 | 64 | 1.7% |
| 0.9 | 63 | 1.7% |
| 1.1 | 62 | 1.7% |
| 1.4 | 60 | 1.6% |
| 1.3 | 57 | 1.5% |
| 2 | 52 | 1.4% |
| 0.95 | 52 | 1.4% |
| 1.6 | 48 | 1.3% |
| Other values (463) | 3059 |
| Value | Count | Frequency (%) |
| 0.07 | 1 | < 0.1% |
| 0.16 | 1 | < 0.1% |
| 0.17 | 1 | < 0.1% |
| 0.19 | 1 | < 0.1% |
| 0.2 | 8 | |
| 0.21 | 6 | |
| 0.22 | 8 | |
| 0.23 | 1 | < 0.1% |
| 0.24 | 6 | |
| 0.25 | 11 |
| Value | Count | Frequency (%) |
| 31.5 | 1 | < 0.1% |
| 27.5 | 1 | < 0.1% |
| 26 | 2 | |
| 25 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 20 | 3 | |
| 19.5 | 2 | |
| 19 | 3 |
price_per_sqft
Real number (ℝ)
| Distinct | 2651 |
|---|---|
| Distinct (%) | 72.4% |
| Missing | 17 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13892.662 |
| Minimum | 4 |
|---|---|
| Maximum | 600000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4716 |
| Q1 | 6818 |
| median | 9020 |
| Q3 | 13878 |
| 95-th percentile | 33333 |
| Maximum | 600000 |
| Range | 599996 |
| Interquartile range (IQR) | 7060 |
Descriptive statistics
| Standard deviation | 23206.896 |
|---|---|
| Coefficient of variation (CV) | 1.6704427 |
| Kurtosis | 186.97985 |
| Mean | 13892.662 |
| Median Absolute Deviation (MAD) | 2795 |
| Skewness | 11.438752 |
| Sum | 50861036 |
| Variance | 5.3856003 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 27 | 0.7% |
| 8000 | 19 | 0.5% |
| 5000 | 17 | 0.5% |
| 12500 | 14 | 0.4% |
| 22222 | 13 | 0.4% |
| 6666 | 13 | 0.4% |
| 11111 | 13 | 0.4% |
| 8333 | 12 | 0.3% |
| 7500 | 12 | 0.3% |
| 6000 | 11 | 0.3% |
| Other values (2641) | 3510 | |
| (Missing) | 17 | 0.5% |
| Value | Count | Frequency (%) |
| 4 | 1 | |
| 5 | 1 | |
| 7 | 1 | |
| 9 | 1 | |
| 53 | 1 | |
| 57 | 1 | |
| 58 | 2 | |
| 60 | 1 | |
| 61 | 1 | |
| 79 | 1 |
| Value | Count | Frequency (%) |
| 600000 | 1 | |
| 400000 | 1 | |
| 315789 | 1 | |
| 308333 | 1 | |
| 290948 | 1 | |
| 283333 | 1 | |
| 266666 | 1 | |
| 261194 | 1 | |
| 245398 | 1 | |
| 241666 | 1 |
area
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 1312 |
|---|---|
| Distinct (%) | 35.8% |
| Missing | 17 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2888.389 |
| Minimum | 50 |
|---|---|
| Maximum | 875000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 519 |
| Q1 | 1233 |
| median | 1733 |
| Q3 | 2300 |
| 95-th percentile | 4246 |
| Maximum | 875000 |
| Range | 874950 |
| Interquartile range (IQR) | 1067 |
Descriptive statistics
| Standard deviation | 23164.341 |
|---|---|
| Coefficient of variation (CV) | 8.0198136 |
| Kurtosis | 942.28654 |
| Mean | 2888.389 |
| Median Absolute Deviation (MAD) | 533 |
| Skewness | 29.735006 |
| Sum | 10574392 |
| Variance | 5.365867 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1650 | 54 | 1.5% |
| 1350 | 48 | 1.3% |
| 1800 | 47 | 1.3% |
| 3240 | 43 | 1.2% |
| 1950 | 43 | 1.2% |
| 2700 | 39 | 1.1% |
| 900 | 38 | 1.0% |
| 2000 | 33 | 0.9% |
| 2250 | 25 | 0.7% |
| 2400 | 23 | 0.6% |
| Other values (1302) | 3268 |
| Value | Count | Frequency (%) |
| 50 | 4 | |
| 55 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 60 | 2 | |
| 61 | 1 | < 0.1% |
| 67 | 2 | |
| 70 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 875000 | 1 | |
| 642857 | 1 | |
| 620000 | 1 | |
| 566667 | 1 | |
| 215517 | 1 | |
| 98978 | 1 | |
| 82781 | 1 | |
| 65517 | 2 | |
| 65261 | 1 | |
| 58228 | 1 |
areaWithType
Categorical
| Distinct | 2355 |
|---|---|
| Distinct (%) | 64.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 428.2 KiB |
| Plot area 360(301.01 sq.m.) | 37 |
|---|---|
| Plot area 300(250.84 sq.m.) | 26 |
| Plot area 502(419.74 sq.m.) | 19 |
| Plot area 200(167.23 sq.m.) | 19 |
| Super Built up area 1950(181.16 sq.m.)Carpet area: 1161 sq.ft. (107.86 sq.m.) | 17 |
| Other values (2350) |
Length
| Max length | 124 |
|---|---|
| Median length | 119 |
| Mean length | 54.229201 |
| Min length | 12 |
Characters and Unicode
| Total characters | 199455 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1849 ? |
|---|---|
| Unique (%) | 50.3% |
Sample
| 1st row | Plot area 1000(836.13 sq.m.) |
|---|---|
| 2nd row | Carpet area: 706 |
| 3rd row | Plot area 360(301.01 sq.m.) |
| 4th row | Built Up area: 1996 (185.43 sq.m.) |
| 5th row | Carpet area: 76.44 |
Common Values
| Value | Count | Frequency (%) |
| Plot area 360(301.01 sq.m.) | 37 | 1.0% |
| Plot area 300(250.84 sq.m.) | 26 | 0.7% |
| Plot area 502(419.74 sq.m.) | 19 | 0.5% |
| Plot area 200(167.23 sq.m.) | 19 | 0.5% |
| Super Built up area 1950(181.16 sq.m.)Carpet area: 1161 sq.ft. (107.86 sq.m.) | 17 | 0.5% |
| Super Built up area 1578(146.6 sq.m.) | 17 | 0.5% |
| Plot area 270(225.75 sq.m.) | 17 | 0.5% |
| Super Built up area 1350(125.42 sq.m.) | 15 | 0.4% |
| Super Built up area 2010(186.74 sq.m.) | 14 | 0.4% |
| Super Built up area 1650(153.29 sq.m.)Carpet area: 1022.58 sq.ft. (95 sq.m.) | 14 | 0.4% |
| Other values (2345) | 3483 |
Length
| Value | Count | Frequency (%) |
| area | 5574 | |
| sq.m | 3656 | |
| up | 3020 | 10.0% |
| built | 2316 | 7.7% |
| super | 1875 | 6.2% |
| sq.ft | 1751 | 5.8% |
| sq.m.)carpet | 1185 | 3.9% |
| sq.m.)built | 702 | 2.3% |
| carpet | 684 | 2.3% |
| plot | 681 | 2.3% |
| Other values (2846) | 8702 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26468 | 13.3% | |
| . | 20391 | 10.2% |
| a | 13157 | 6.6% |
| r | 9458 | 4.7% |
| e | 9322 | 4.7% |
| 1 | 9206 | 4.6% |
| s | 7568 | 3.8% |
| q | 7432 | 3.7% |
| t | 7325 | 3.7% |
| u | 6770 | 3.4% |
| Other values (25) | 82358 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 82770 | |
| Decimal Number | 47142 | |
| Space Separator | 26468 | 13.3% |
| Other Punctuation | 23409 | 11.7% |
| Uppercase Letter | 8594 | 4.3% |
| Close Punctuation | 5536 | 2.8% |
| Open Punctuation | 5536 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 13157 | |
| r | 9458 | |
| e | 9322 | |
| s | 7568 | |
| q | 7432 | |
| t | 7325 | |
| u | 6770 | |
| p | 6768 | |
| m | 5545 | |
| l | 3701 | 4.5% |
| Other values (5) | 5724 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9206 | |
| 0 | 6630 | |
| 2 | 5689 | |
| 5 | 4714 | |
| 3 | 3961 | |
| 4 | 3711 | |
| 6 | 3674 | 7.8% |
| 7 | 3254 | 6.9% |
| 8 | 3159 | 6.7% |
| 9 | 3144 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 3020 | |
| S | 1875 | |
| C | 1873 | |
| U | 1145 | 13.3% |
| P | 681 | 7.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20391 | |
| : | 3018 | 12.9% |
Space Separator
| Value | Count | Frequency (%) |
| 26468 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5536 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5536 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 108091 | |
| Latin | 91364 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 13157 | |
| r | 9458 | |
| e | 9322 | |
| s | 7568 | |
| q | 7432 | |
| t | 7325 | |
| u | 6770 | |
| p | 6768 | |
| m | 5545 | 6.1% |
| l | 3701 | 4.1% |
| Other values (10) | 14318 |
Common
| Value | Count | Frequency (%) |
| 26468 | ||
| . | 20391 | |
| 1 | 9206 | 8.5% |
| 0 | 6630 | 6.1% |
| 2 | 5689 | 5.3% |
| ) | 5536 | 5.1% |
| ( | 5536 | 5.1% |
| 5 | 4714 | 4.4% |
| 3 | 3961 | 3.7% |
| 4 | 3711 | 3.4% |
| Other values (5) | 16249 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 199455 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 26468 | 13.3% | |
| . | 20391 | 10.2% |
| a | 13157 | 6.6% |
| r | 9458 | 4.7% |
| e | 9322 | 4.7% |
| 1 | 9206 | 4.6% |
| s | 7568 | 3.8% |
| q | 7432 | 3.7% |
| t | 7325 | 3.7% |
| u | 6770 | 3.4% |
| Other values (25) | 82358 |
bedRoom
Real number (ℝ)
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3602501 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8974002 |
|---|---|
| Coefficient of variation (CV) | 0.5646604 |
| Kurtosis | 18.216047 |
| Mean | 3.3602501 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.4851888 |
| Sum | 12359 |
| Variance | 3.6001274 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1496 | |
| 2 | 942 | |
| 4 | 661 | |
| 5 | 210 | 5.7% |
| 1 | 124 | 3.4% |
| 6 | 74 | 2.0% |
| 9 | 41 | 1.1% |
| 8 | 30 | 0.8% |
| 7 | 28 | 0.8% |
| 12 | 28 | 0.8% |
| Other values (9) | 44 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 124 | 3.4% |
| 2 | 942 | |
| 3 | 1496 | |
| 4 | 661 | |
| 5 | 210 | 5.7% |
| 6 | 74 | 2.0% |
| 7 | 28 | 0.8% |
| 8 | 30 | 0.8% |
| 9 | 41 | 1.1% |
| 10 | 20 | 0.5% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 2 | 0.1% |
| 18 | 2 | 0.1% |
| 16 | 12 | |
| 14 | 1 | < 0.1% |
| 13 | 4 | 0.1% |
| 12 | 28 | |
| 11 | 1 | < 0.1% |
| 10 | 20 |
bathroom
Real number (ℝ)
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4249592 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.9479764 |
|---|---|
| Coefficient of variation (CV) | 0.56875901 |
| Kurtosis | 17.537825 |
| Mean | 3.4249592 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.2478883 |
| Sum | 12597 |
| Variance | 3.7946121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1077 | |
| 2 | 1047 | |
| 4 | 820 | |
| 5 | 295 | 8.0% |
| 1 | 156 | 4.2% |
| 6 | 117 | 3.2% |
| 9 | 41 | 1.1% |
| 7 | 40 | 1.1% |
| 8 | 25 | 0.7% |
| 12 | 22 | 0.6% |
| Other values (9) | 38 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 156 | 4.2% |
| 2 | 1047 | |
| 3 | 1077 | |
| 4 | 820 | |
| 5 | 295 | 8.0% |
| 6 | 117 | 3.2% |
| 7 | 40 | 1.1% |
| 8 | 25 | 0.7% |
| 9 | 41 | 1.1% |
| 10 | 9 | 0.2% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 3 | 0.1% |
| 18 | 4 | 0.1% |
| 17 | 3 | 0.1% |
| 16 | 8 | 0.2% |
| 14 | 2 | 0.1% |
| 13 | 4 | 0.1% |
| 12 | 22 | |
| 11 | 4 | 0.1% |
| 10 | 9 |
balcony
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 238.2 KiB |
| 3+ | |
|---|---|
| 3 | |
| 2 | |
| 1 | |
| 0 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.3189233 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4851 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3+ |
|---|---|
| 2nd row | 2 |
| 3rd row | 3+ |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 3+ | 1173 | |
| 3 | 1074 | |
| 2 | 884 | |
| 1 | 365 | 9.9% |
| 0 | 182 | 4.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 2247 | |
| 2 | 884 | 24.0% |
| 1 | 365 | 9.9% |
| 0 | 182 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2247 | |
| + | 1173 | |
| 2 | 884 | 18.2% |
| 1 | 365 | 7.5% |
| 0 | 182 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 | |
| Math Symbol | 1173 | 24.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2247 | |
| 2 | 884 | 24.0% |
| 1 | 365 | 9.9% |
| 0 | 182 | 4.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1173 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4851 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 2247 | |
| + | 1173 | |
| 2 | 884 | 18.2% |
| 1 | 365 | 7.5% |
| 0 | 182 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4851 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 2247 | |
| + | 1173 | |
| 2 | 884 | 18.2% |
| 1 | 365 | 7.5% |
| 0 | 182 | 3.8% |
floorNum
Real number (ℝ)
| Distinct | 43 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 19 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.7974857 |
| Minimum | 0 |
|---|---|
| Maximum | 51 |
| Zeros | 129 |
| Zeros (%) | 3.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 10 |
| 95-th percentile | 18 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 6.0118103 |
|---|---|
| Coefficient of variation (CV) | 0.88441678 |
| Kurtosis | 4.5174311 |
| Mean | 6.7974857 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.6941339 |
| Sum | 24872 |
| Variance | 36.141864 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 498 | |
| 2 | 493 | |
| 1 | 351 | 9.5% |
| 4 | 317 | 8.6% |
| 8 | 195 | 5.3% |
| 6 | 183 | 5.0% |
| 10 | 179 | 4.9% |
| 7 | 176 | 4.8% |
| 5 | 169 | 4.6% |
| 9 | 161 | 4.4% |
| Other values (33) | 937 |
| Value | Count | Frequency (%) |
| 0 | 129 | 3.5% |
| 1 | 351 | |
| 2 | 493 | |
| 3 | 498 | |
| 4 | 317 | |
| 5 | 169 | 4.6% |
| 6 | 183 | 5.0% |
| 7 | 176 | 4.8% |
| 8 | 195 | 5.3% |
| 9 | 161 | 4.4% |
| Value | Count | Frequency (%) |
| 51 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 43 | 2 | |
| 40 | 1 | < 0.1% |
| 39 | 2 | |
| 38 | 1 | < 0.1% |
| 35 | 2 | |
| 34 | 2 | |
| 33 | 4 |
facing
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1045 |
| Missing (%) | 28.4% |
| Memory size | 225.5 KiB |
| East | |
|---|---|
| North-East | |
| North | |
| West | |
| South | |
| Other values (3) |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 6.837068 |
| Min length | 4 |
Characters and Unicode
| Total characters | 18002 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | North-East |
|---|---|
| 2nd row | East |
| 3rd row | East |
| 4th row | North |
| 5th row | South |
Common Values
| Value | Count | Frequency (%) |
| East | 624 | |
| North-East | 623 | |
| North | 387 | 10.5% |
| West | 249 | 6.8% |
| South | 231 | 6.3% |
| North-West | 193 | 5.2% |
| South-East | 173 | 4.7% |
| South-West | 153 | 4.2% |
| (Missing) | 1045 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| east | 624 | |
| north-east | 623 | |
| north | 387 | |
| west | 249 | 9.5% |
| south | 231 | 8.8% |
| north-west | 193 | 7.3% |
| south-east | 173 | 6.6% |
| south-west | 153 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3775 | |
| s | 2015 | |
| o | 1760 | |
| h | 1760 | |
| E | 1420 | 7.9% |
| a | 1420 | 7.9% |
| N | 1203 | 6.7% |
| r | 1203 | 6.7% |
| - | 1142 | 6.3% |
| W | 595 | 3.3% |
| Other values (3) | 1709 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13085 | |
| Uppercase Letter | 3775 | 21.0% |
| Dash Punctuation | 1142 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3775 | |
| s | 2015 | |
| o | 1760 | |
| h | 1760 | |
| a | 1420 | 10.9% |
| r | 1203 | 9.2% |
| e | 595 | 4.5% |
| u | 557 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1420 | |
| N | 1203 | |
| W | 595 | |
| S | 557 | 14.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1142 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16860 | |
| Common | 1142 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3775 | |
| s | 2015 | |
| o | 1760 | |
| h | 1760 | |
| E | 1420 | 8.4% |
| a | 1420 | 8.4% |
| N | 1203 | 7.1% |
| r | 1203 | 7.1% |
| W | 595 | 3.5% |
| e | 595 | 3.5% |
| Other values (2) | 1114 | 6.6% |
Common
| Value | Count | Frequency (%) |
| - | 1142 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18002 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3775 | |
| s | 2015 | |
| o | 1760 | |
| h | 1760 | |
| E | 1420 | 7.9% |
| a | 1420 | 7.9% |
| N | 1203 | 6.7% |
| r | 1203 | 6.7% |
| - | 1142 | 6.3% |
| W | 595 | 3.3% |
| Other values (3) | 1709 |
agePossession
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 281.5 KiB |
| Relatively New | |
|---|---|
| New Property | |
| Moderately Old | |
| Undefined | |
| Old Property |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 13.385536 |
| Min length | 9 |
Characters and Unicode
| Total characters | 49232 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Moderately Old |
|---|---|
| 2nd row | Relatively New |
| 3rd row | Moderately Old |
| 4th row | Undefined |
| 5th row | Under Construction |
Common Values
| Value | Count | Frequency (%) |
| Relatively New | 1646 | |
| New Property | 594 | 16.2% |
| Moderately Old | 563 | 15.3% |
| Undefined | 306 | 8.3% |
| Old Property | 303 | 8.2% |
| Under Construction | 266 | 7.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| new | 2240 | |
| relatively | 1646 | |
| property | 897 | |
| old | 866 | 12.3% |
| moderately | 563 | 8.0% |
| undefined | 306 | 4.3% |
| under | 266 | 3.8% |
| construction | 266 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8433 | |
| l | 4721 | 9.6% |
| t | 3638 | 7.4% |
| 3372 | 6.8% | |
| y | 3106 | 6.3% |
| r | 2889 | 5.9% |
| d | 2307 | 4.7% |
| N | 2240 | 4.5% |
| w | 2240 | 4.5% |
| i | 2218 | 4.5% |
| Other values (15) | 14068 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 38810 | |
| Uppercase Letter | 7050 | 14.3% |
| Space Separator | 3372 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8433 | |
| l | 4721 | |
| t | 3638 | |
| y | 3106 | 8.0% |
| r | 2889 | 7.4% |
| d | 2307 | 5.9% |
| w | 2240 | 5.8% |
| i | 2218 | 5.7% |
| a | 2209 | 5.7% |
| o | 1992 | 5.1% |
| Other values (7) | 5057 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2240 | |
| R | 1646 | |
| P | 897 | |
| O | 866 | 12.3% |
| U | 572 | 8.1% |
| M | 563 | 8.0% |
| C | 266 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 3372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45860 | |
| Common | 3372 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8433 | |
| l | 4721 | 10.3% |
| t | 3638 | 7.9% |
| y | 3106 | 6.8% |
| r | 2889 | 6.3% |
| d | 2307 | 5.0% |
| N | 2240 | 4.9% |
| w | 2240 | 4.9% |
| i | 2218 | 4.8% |
| a | 2209 | 4.8% |
| Other values (14) | 11859 |
Common
| Value | Count | Frequency (%) |
| 3372 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49232 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8433 | |
| l | 4721 | 9.6% |
| t | 3638 | 7.4% |
| 3372 | 6.8% | |
| y | 3106 | 6.3% |
| r | 2889 | 5.9% |
| d | 2307 | 4.7% |
| N | 2240 | 4.5% |
| w | 2240 | 4.5% |
| i | 2218 | 4.5% |
| Other values (15) | 14068 |
super_built_up_area
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 593 |
|---|---|
| Distinct (%) | 31.6% |
| Missing | 1803 |
| Missing (%) | 49.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1925.2376 |
| Minimum | 89 |
|---|---|
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 89 |
|---|---|
| 5-th percentile | 767 |
| Q1 | 1479.5 |
| median | 1828 |
| Q3 | 2215 |
| 95-th percentile | 3185 |
| Maximum | 10000 |
| Range | 9911 |
| Interquartile range (IQR) | 735.5 |
Descriptive statistics
| Standard deviation | 764.17218 |
|---|---|
| Coefficient of variation (CV) | 0.39692356 |
| Kurtosis | 10.349191 |
| Mean | 1925.2376 |
| Median Absolute Deviation (MAD) | 372 |
| Skewness | 1.8364563 |
| Sum | 3609820.6 |
| Variance | 583959.12 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1650 | 37 | 1.0% |
| 1950 | 37 | 1.0% |
| 2000 | 25 | 0.7% |
| 1578 | 25 | 0.7% |
| 1640 | 22 | 0.6% |
| 2150 | 22 | 0.6% |
| 1900 | 19 | 0.5% |
| 2408 | 19 | 0.5% |
| 1930 | 18 | 0.5% |
| 2812 | 17 | 0.5% |
| Other values (583) | 1634 | |
| (Missing) | 1803 |
| Value | Count | Frequency (%) |
| 89 | 1 | |
| 145 | 1 | |
| 161 | 1 | |
| 215 | 1 | |
| 216 | 1 | |
| 325 | 1 | |
| 340 | 1 | |
| 352 | 1 | |
| 380 | 1 | |
| 406 | 1 |
| Value | Count | Frequency (%) |
| 10000 | 1 | |
| 6926 | 1 | |
| 6000 | 1 | |
| 5800 | 2 | |
| 5514 | 1 | |
| 5350 | 2 | |
| 5200 | 2 | |
| 4890 | 1 | |
| 4857 | 1 | |
| 4848 | 2 |
built_up_area
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 644 |
|---|---|
| Distinct (%) | 38.1% |
| Missing | 1988 |
| Missing (%) | 54.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2379.5858 |
| Minimum | 2 |
|---|---|
| Maximum | 737147 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 240.45 |
| Q1 | 1100 |
| median | 1650 |
| Q3 | 2400 |
| 95-th percentile | 4691 |
| Maximum | 737147 |
| Range | 737145 |
| Interquartile range (IQR) | 1300 |
Descriptive statistics
| Standard deviation | 17942.88 |
|---|---|
| Coefficient of variation (CV) | 7.5403375 |
| Kurtosis | 1667.8704 |
| Mean | 2379.5858 |
| Median Absolute Deviation (MAD) | 650 |
| Skewness | 40.706572 |
| Sum | 4021500 |
| Variance | 3.2194695 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1800 | 41 | 1.1% |
| 3240 | 37 | 1.0% |
| 1900 | 34 | 0.9% |
| 1350 | 33 | 0.9% |
| 2700 | 33 | 0.9% |
| 900 | 28 | 0.8% |
| 1600 | 26 | 0.7% |
| 1300 | 24 | 0.7% |
| 2000 | 24 | 0.7% |
| 1700 | 23 | 0.6% |
| Other values (634) | 1387 | |
| (Missing) | 1988 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 50 | 3 | |
| 53 | 1 | < 0.1% |
| 55 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 60 | 5 |
| Value | Count | Frequency (%) |
| 737147 | 1 | < 0.1% |
| 13500 | 1 | < 0.1% |
| 11286 | 1 | < 0.1% |
| 9500 | 1 | < 0.1% |
| 9000 | 7 | |
| 8775 | 1 | < 0.1% |
| 8286 | 1 | < 0.1% |
| 8067.8 | 1 | < 0.1% |
| 8000 | 1 | < 0.1% |
| 7500 | 2 | 0.1% |
carpet_area
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 733 |
|---|---|
| Distinct (%) | 39.1% |
| Missing | 1805 |
| Missing (%) | 49.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2529.4843 |
| Minimum | 15 |
|---|---|
| Maximum | 607936 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 350 |
| Q1 | 845 |
| median | 1300 |
| Q3 | 1790 |
| 95-th percentile | 2970 |
| Maximum | 607936 |
| Range | 607921 |
| Interquartile range (IQR) | 945 |
Descriptive statistics
| Standard deviation | 22793.75 |
|---|---|
| Coefficient of variation (CV) | 9.0112242 |
| Kurtosis | 604.8596 |
| Mean | 2529.4843 |
| Median Absolute Deviation (MAD) | 475 |
| Skewness | 24.339675 |
| Sum | 4737724 |
| Variance | 5.1955503 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1400 | 42 | 1.1% |
| 1800 | 35 | 1.0% |
| 1600 | 35 | 1.0% |
| 1200 | 31 | 0.8% |
| 1500 | 29 | 0.8% |
| 1650 | 28 | 0.8% |
| 1350 | 27 | 0.7% |
| 1300 | 23 | 0.6% |
| 1450 | 22 | 0.6% |
| 1000 | 22 | 0.6% |
| Other values (723) | 1579 | |
| (Missing) | 1805 |
| Value | Count | Frequency (%) |
| 15 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 48 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 59 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 66 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 76.44 | 3 | |
| 77.31 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 607936 | 1 | |
| 569243 | 1 | |
| 514396 | 1 | |
| 64529 | 1 | |
| 64412 | 1 | |
| 58141 | 1 | |
| 54917 | 1 | |
| 48811 | 1 | |
| 45966 | 1 | |
| 34401 | 1 |
study room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 706 | 19.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 706 | 19.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 706 | 19.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 706 | 19.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 706 | 19.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2972 | |
| 1 | 706 | 19.2% |
servant room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1329 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1329 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1329 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1329 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1329 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2349 | |
| 1 | 1329 |
store room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
pooja room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 657 | 17.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 657 | 17.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 657 | 17.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 657 | 17.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 657 | 17.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3021 | |
| 1 | 657 | 17.9% |
others
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
furnishing_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 237.1 KiB |
| 0 | |
|---|---|
| 2 | |
| 1 | 206 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2417 | |
| 2 | 1055 | |
| 1 | 206 | 5.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2417 | |
| 2 | 1055 | |
| 1 | 206 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2417 | |
| 2 | 1055 | |
| 1 | 206 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2417 | |
| 2 | 1055 | |
| 1 | 206 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2417 | |
| 2 | 1055 | |
| 1 | 206 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2417 | |
| 2 | 1055 | |
| 1 | 206 | 5.6% |
luxury_score
Real number (ℝ)
| Distinct | 161 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71.517401 |
| Minimum | 0 |
|---|---|
| Maximum | 174 |
| Zeros | 462 |
| Zeros (%) | 12.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 31 |
| median | 59 |
| Q3 | 110 |
| 95-th percentile | 174 |
| Maximum | 174 |
| Range | 174 |
| Interquartile range (IQR) | 79 |
Descriptive statistics
| Standard deviation | 53.052563 |
|---|---|
| Coefficient of variation (CV) | 0.74181336 |
| Kurtosis | -0.87989139 |
| Mean | 71.517401 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 0.45884513 |
| Sum | 263041 |
| Variance | 2814.5745 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 462 | 12.6% |
| 49 | 348 | 9.5% |
| 174 | 195 | 5.3% |
| 44 | 60 | 1.6% |
| 38 | 55 | 1.5% |
| 165 | 55 | 1.5% |
| 72 | 52 | 1.4% |
| 60 | 47 | 1.3% |
| 37 | 45 | 1.2% |
| 42 | 45 | 1.2% |
| Other values (151) | 2314 |
| Value | Count | Frequency (%) |
| 0 | 462 | |
| 5 | 6 | 0.2% |
| 6 | 6 | 0.2% |
| 7 | 41 | 1.1% |
| 8 | 30 | 0.8% |
| 9 | 9 | 0.2% |
| 12 | 6 | 0.2% |
| 13 | 10 | 0.3% |
| 14 | 12 | 0.3% |
| 15 | 43 | 1.2% |
| Value | Count | Frequency (%) |
| 174 | 195 | |
| 169 | 1 | < 0.1% |
| 168 | 9 | 0.2% |
| 167 | 21 | 0.6% |
| 166 | 10 | 0.3% |
| 165 | 55 | 1.5% |
| 161 | 3 | 0.1% |
| 160 | 28 | 0.8% |
| 159 | 23 | 0.6% |
| 158 | 34 | 0.9% |
| price | price_per_sqft | area | bedRoom | bathroom | floorNum | super_built_up_area | built_up_area | carpet_area | luxury_score | property_type | balcony | facing | agePossession | study room | servant room | store room | pooja room | others | furnishing_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| price | 1.000 | 0.744 | 0.744 | 0.681 | 0.720 | 0.001 | 0.772 | 0.605 | 0.614 | 0.215 | 0.542 | 0.136 | 0.021 | 0.102 | 0.244 | 0.369 | 0.303 | 0.335 | 0.034 | 0.175 |
| price_per_sqft | 0.744 | 1.000 | 0.207 | 0.417 | 0.411 | -0.126 | 0.287 | 0.132 | 0.137 | 0.054 | 0.201 | 0.033 | 0.000 | 0.056 | 0.030 | 0.044 | 0.000 | 0.043 | 0.036 | 0.022 |
| area | 0.744 | 0.207 | 1.000 | 0.624 | 0.687 | 0.116 | 0.948 | 0.835 | 0.801 | 0.259 | 0.028 | 0.011 | 0.022 | 0.000 | 0.018 | 0.015 | 0.039 | 0.037 | 0.042 | 0.043 |
| bedRoom | 0.681 | 0.417 | 0.624 | 1.000 | 0.862 | -0.104 | 0.800 | 0.380 | 0.569 | 0.057 | 0.595 | 0.176 | 0.032 | 0.129 | 0.155 | 0.317 | 0.223 | 0.291 | 0.080 | 0.167 |
| bathroom | 0.720 | 0.411 | 0.687 | 0.862 | 1.000 | -0.005 | 0.819 | 0.465 | 0.599 | 0.179 | 0.471 | 0.226 | 0.044 | 0.111 | 0.176 | 0.520 | 0.244 | 0.286 | 0.070 | 0.198 |
| floorNum | 0.001 | -0.126 | 0.116 | -0.104 | -0.005 | 1.000 | 0.152 | 0.091 | 0.158 | 0.232 | 0.484 | 0.079 | 0.000 | 0.125 | 0.079 | 0.083 | 0.112 | 0.103 | 0.033 | 0.017 |
| super_built_up_area | 0.772 | 0.287 | 0.948 | 0.800 | 0.819 | 0.152 | 1.000 | 0.926 | 0.894 | 0.222 | 1.000 | 0.306 | 0.000 | 0.086 | 0.121 | 0.584 | 0.046 | 0.157 | 0.084 | 0.134 |
| built_up_area | 0.605 | 0.132 | 0.835 | 0.380 | 0.465 | 0.091 | 0.926 | 1.000 | 0.969 | 0.289 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.088 |
| carpet_area | 0.614 | 0.137 | 0.801 | 0.569 | 0.599 | 0.158 | 0.894 | 0.969 | 1.000 | 0.239 | 0.000 | 0.026 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.016 | 0.000 |
| luxury_score | 0.215 | 0.054 | 0.259 | 0.057 | 0.179 | 0.232 | 0.222 | 0.289 | 0.239 | 1.000 | 0.329 | 0.223 | 0.065 | 0.255 | 0.183 | 0.347 | 0.228 | 0.189 | 0.176 | 0.244 |
| property_type | 0.542 | 0.201 | 0.028 | 0.595 | 0.471 | 0.484 | 1.000 | 0.000 | 0.000 | 0.329 | 1.000 | 0.214 | 0.094 | 0.379 | 0.127 | 0.065 | 0.241 | 0.251 | 0.026 | 0.080 |
| balcony | 0.136 | 0.033 | 0.011 | 0.176 | 0.226 | 0.079 | 0.306 | 0.000 | 0.026 | 0.223 | 0.214 | 1.000 | 0.016 | 0.274 | 0.183 | 0.441 | 0.146 | 0.197 | 0.081 | 0.178 |
| facing | 0.021 | 0.000 | 0.022 | 0.032 | 0.044 | 0.000 | 0.000 | 1.000 | 0.000 | 0.065 | 0.094 | 0.016 | 1.000 | 0.092 | 0.000 | 0.035 | 0.035 | 0.027 | 0.000 | 0.049 |
| agePossession | 0.102 | 0.056 | 0.000 | 0.129 | 0.111 | 0.125 | 0.086 | 0.000 | 0.000 | 0.255 | 0.379 | 0.274 | 0.092 | 1.000 | 0.141 | 0.286 | 0.143 | 0.186 | 0.108 | 0.214 |
| study room | 0.244 | 0.030 | 0.018 | 0.155 | 0.176 | 0.079 | 0.121 | 0.000 | 0.000 | 0.183 | 0.127 | 0.183 | 0.000 | 0.141 | 1.000 | 0.185 | 0.226 | 0.314 | 0.031 | 0.141 |
| servant room | 0.369 | 0.044 | 0.015 | 0.317 | 0.520 | 0.083 | 0.584 | 0.000 | 0.000 | 0.347 | 0.065 | 0.441 | 0.035 | 0.286 | 0.185 | 1.000 | 0.161 | 0.252 | 0.000 | 0.270 |
| store room | 0.303 | 0.000 | 0.039 | 0.223 | 0.244 | 0.112 | 0.046 | 0.000 | 0.000 | 0.228 | 0.241 | 0.146 | 0.035 | 0.143 | 0.226 | 0.161 | 1.000 | 0.305 | 0.106 | 0.157 |
| pooja room | 0.335 | 0.043 | 0.037 | 0.291 | 0.286 | 0.103 | 0.157 | 0.000 | 0.000 | 0.189 | 0.251 | 0.197 | 0.027 | 0.186 | 0.314 | 0.252 | 0.305 | 1.000 | 0.033 | 0.216 |
| others | 0.034 | 0.036 | 0.042 | 0.080 | 0.070 | 0.033 | 0.084 | 0.000 | 0.016 | 0.176 | 0.026 | 0.081 | 0.000 | 0.108 | 0.031 | 0.000 | 0.106 | 0.033 | 1.000 | 0.060 |
| furnishing_type | 0.175 | 0.022 | 0.043 | 0.167 | 0.198 | 0.017 | 0.134 | 0.088 | 0.000 | 0.244 | 0.080 | 0.178 | 0.049 | 0.214 | 0.141 | 0.270 | 0.157 | 0.216 | 0.060 | 1.000 |
| property_type | society | sector | price | price_per_sqft | area | areaWithType | bedRoom | bathroom | balcony | floorNum | facing | agePossession | super_built_up_area | built_up_area | carpet_area | study room | servant room | store room | pooja room | others | furnishing_type | luxury_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | house | arjun marg/ sector- 26 phase- 1/ golf course road | sector 26 | 31.50 | 35000.0 | 9000.0 | Plot area 1000(836.13 sq.m.) | 7 | 9 | 3+ | 3.0 | North-East | Moderately Old | NaN | 9000.0 | NaN | 1 | 1 | 1 | 1 | 0 | 1 | 74 |
| 1 | flat | pivotal riddhi siddhi | sector 99 | 0.72 | 947.0 | 7603.0 | Carpet area: 706 | 2 | 2 | 2 | 12.0 | NaN | Relatively New | NaN | NaN | 706.00 | 0 | 0 | 1 | 0 | 0 | 0 | 31 |
| 2 | house | unitech espace | sector 50 | 10.30 | 31790.0 | 3240.0 | Plot area 360(301.01 sq.m.) | 5 | 6 | 3+ | 3.0 | East | Moderately Old | NaN | 3240.0 | NaN | 1 | 1 | 1 | 1 | 0 | 2 | 160 |
| 3 | flat | godrej nature plus | sector 33 | 1.75 | 8768.0 | 1996.0 | Built Up area: 1996 (185.43 sq.m.) | 3 | 3 | 0 | 2.0 | NaN | Undefined | NaN | 1996.0 | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 56 |
| 4 | flat | godrej nature plus | sector 33 | 1.20 | 13369.0 | 898.0 | Carpet area: 76.44 | 2 | 2 | 0 | 11.0 | NaN | Under Construction | NaN | NaN | 76.44 | 0 | 0 | 0 | 0 | 0 | 0 | 56 |
| 5 | house | independent | sector 43 | 15.50 | 28233.0 | 5490.0 | Plot area 610(510.04 sq.m.) | 5 | 6 | 3 | 3.0 | East | Moderately Old | NaN | 5490.0 | NaN | 1 | 1 | 1 | 1 | 0 | 0 | 76 |
| 6 | flat | breez global hill view | sohna road | 0.35 | 5319.0 | 658.0 | Built Up area: 658 (61.13 sq.m.)Carpet area: 554.17 sq.ft. (51.48 sq.m.) | 2 | 2 | 2 | 18.0 | NaN | New Property | NaN | 658.0 | 554.17 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| 7 | flat | emaar gurgaon greens | sector 102 | 1.40 | 8484.0 | 1650.0 | Super Built up area 1650(153.29 sq.m.) | 3 | 3 | 3 | 4.0 | North | Relatively New | 1650.0 | NaN | NaN | 0 | 1 | 0 | 0 | 0 | 2 | 83 |
| 8 | flat | vatika gurgaon | sector 83 | 1.15 | 5808.0 | 1980.0 | Super Built up area 1980(183.95 sq.m.)Built Up area: 1350 sq.ft. (125.42 sq.m.)Carpet area: 1308 sq.ft. (121.52 sq.m.) | 3 | 3 | 2 | 3.0 | South | Relatively New | 1980.0 | 1350.0 | 1308.00 | 1 | 1 | 1 | 1 | 0 | 2 | 165 |
| 9 | house | ss aaron ville | sector 49 | 6.50 | 18808.0 | 3456.0 | Plot area 384(321.07 sq.m.) | 5 | 5 | 2 | 2.0 | North-East | Relatively New | NaN | 3456.0 | NaN | 1 | 1 | 0 | 1 | 0 | 2 | 28 |
| property_type | society | sector | price | price_per_sqft | area | areaWithType | bedRoom | bathroom | balcony | floorNum | facing | agePossession | super_built_up_area | built_up_area | carpet_area | study room | servant room | store room | pooja room | others | furnishing_type | luxury_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3793 | house | independent | sector 26 | 14.75 | 51864.0 | 2844.0 | Plot area 316(264.22 sq.m.) | 16 | 20 | 3+ | 4.0 | East | New Property | NaN | 2844.0 | NaN | 1 | 1 | 1 | 1 | 0 | 2 | 153 |
| 3794 | house | vipul tatvam villa | sector 48 | 8.50 | 26235.0 | 3240.0 | Plot area 360(301.01 sq.m.) | 4 | 4 | 1 | NaN | NaN | Relatively New | NaN | 3240.0 | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| 3795 | house | vipul tatvam villa | sector 48 | 6.40 | 24691.0 | 2592.0 | Plot area 288(240.8 sq.m.)Built Up area: 240 sq.yards (200.67 sq.m.)Carpet area: 200 sq.yards (167.23 sq.m.) | 3 | 4 | 3 | 2.0 | North | Relatively New | NaN | 240.0 | 200.000000 | 1 | 1 | 1 | 0 | 0 | 2 | 148 |
| 3796 | flat | railway officers rpf society | sector 9a | 1.25 | 6921.0 | 1806.0 | Carpet area: 1806 (167.78 sq.m.) | 4 | 3 | 3 | 1.0 | NaN | Old Property | NaN | NaN | 1806.000000 | 0 | 1 | 0 | 0 | 0 | 0 | 40 |
| 3797 | flat | emaar palm gardens | sector 83 | 1.80 | 9473.0 | 1900.0 | Super Built up area 1900(176.52 sq.m.) | 3 | 3 | 3+ | 9.0 | South | Relatively New | 1900.0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 55 |
| 3798 | flat | rof ananda | sector 95 | 0.32 | 5827.0 | 549.0 | Carpet area: 549.16 (51.02 sq.m.) | 2 | 2 | 1 | 9.0 | North | Relatively New | NaN | NaN | 549.174178 | 0 | 0 | 0 | 1 | 0 | 2 | 71 |
| 3799 | flat | raheja vedaanta | sector 108 | 0.95 | 5214.0 | 1822.0 | Super Built up area 1822(169.27 sq.m.) | 3 | 3 | 3 | 3.0 | NaN | Relatively New | 1822.0 | NaN | NaN | 0 | 0 | 0 | 0 | 1 | 0 | 95 |
| 3800 | flat | breez global heights | sohna road | 0.21 | 5329.0 | 394.0 | Carpet area: 394 (36.6 sq.m.) | 1 | 1 | 1 | 2.0 | NaN | Relatively New | NaN | NaN | 394.000000 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| 3801 | house | independent | sector 50 | 11.58 | 35741.0 | 3240.0 | Plot area 360(301.01 sq.m.) | 5 | 5 | 3 | 2.0 | NaN | Moderately Old | NaN | 3240.0 | NaN | 0 | 1 | 0 | 0 | 0 | 0 | 20 |
| 3802 | flat | pioneer araya | sector 62 | 8.35 | 19513.0 | 4279.0 | Super Built up area 4279(397.53 sq.m.) | 4 | 6 | 3 | 16.0 | East | Relatively New | 4279.0 | NaN | NaN | 0 | 1 | 0 | 1 | 0 | 0 | 153 |